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Abstract. We adapt the proof of the Green- Tao theorem on arithmetic progressions in 
primes to the setting of polynomials over a finite fields, to show that for every k, the irreducible 
polynomials in Fq[t] contains configurations of the form {/ + Pg : deg(P) < k},g 7^ 0. 



1. Introduction 



In |13j . Green and Tao proved the following celebrated theorem now bearing their name: 

Theorem 1 (Green- Tao). The primes contain arithmetic progressions of arbitrarily length. 
Furthermore, the same conclusion is true for any subset of positive relative upper density of 
the primes. 



Subsequently, other variants of this theorem have been proved. Tao and Ziegler proved 
the generalization for polynomial progressions a + pi{d) , . . . , a + pk{d) , where pi G Z[a;] and 
Pi{0) = 0. Tao [24J proved the analog in the Gaussian integers. 

It is well known that the integers and the polynomials over a finite field share a lot of similarities 
in many aspects relevant to arithmetic combinatorics. Therefore, it is natural, as Green and 
Tao did, to suggest that the analog of this theorem should hold in the setting of function 
fields: 

Conjecture 1. For any finite field F , the monic irreducible polynomials in F[t] contain affine 
spaces of arbitrarily high dimension. 



We give an affirmative answer to this conjecture. More precisely, we will prove: 

Theorem 2 (Green- Tao for function fields). Let Fq be a finite field over q elements. Then 
for any k > 0, we can find polynomials f,g £ Fq[t],g 7^ such that the polynomials f + 
Pg, where P runs over all polynomials P G Fq\t] of degree less than k, are all irreducible. 
Furthermore, such configurations can be found in any set of positive relative upper density 
among the irreducible polynomials. 

Here we define the upper density of a set ^ C Fq[t] to be d{A) = limjv^oo '^^^^■^'^^^^^''^^^ ^ 
and the relative upper density of A in the set V of all irreducible polynomials to be d'p{A) = 
limjv^oo #{/gp'deg(/)<jV} • "^^^ conjecture then follows since the monic polynomials is of posi- 
tive density in all the polynomials. 



Our arguments follow Green- Tao's very closely. We also have chosen to incorporate some mod- 
ifications that simplify considerably some major steps in the original arguments. Therefore, 
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the paper may prove to be helpful to those who want to understand the ideas of the proof of 
Green- Tao's theorem. 

Acknowledgements. I am grateful to my advisor Terence Tao for suggesting me this project, 
frequent consultation and assistance throughout the preparation of this paper. 



2. Outline of the proof and notation 

2.1. Notation. Through out the paper, we will be working with a fixed field Fq on q elements, 
where g is a prime power. Let Fq[t] be the ring of polynomials with coefficients in Fq. Let 
Fq(t) be the quotient ring of Fq[t], i.e. Fq{t) = {^\f,g G Fq[t],g / 0}. The value of k will be 
kept fixed. We will keep our notation consistent with that of Green and Tao. 

It is clear that it suffices to find afiine spaces in the irreducible polynomials of degree smaller 
than N where N is sufficiently large. Denote by Gat the set of all polynomials in Fq[t] of 
degree less than A^. A priori, Gjv is an additive group. Wc also, for each N, fix a monic 
irreducible polynomial Jn £ ^q[t]- Then the additive group Gn can be endowed by a field 
structure isomorphic to FgN, the field on elements via multiplication modulo /n- The 
need for the field structure arises in the same way as when we convert {!,... ,N} into Zjv, 
the main reason being that we can freely perform divisions. Of course, there is a price to pay, 
namely the "wraparound" effect, which arises when we want to pull things back from F^jv to 
Gjv, but this is easy to deal with. In the setting of Fq[t] this is even simpler, since addition of 
polynomials does not increase the maximum of the degrees, quite contrarily to the integers. 

If (?!) is a function on a finite set A, we write F(4>{x)\x G ^), or Fx^A(f'{^): or simply Fa4> to 
denote the expectation of cj) on A, in other words the average value of (f> on A. We denote the 
inner product of two functions <p,ip on A as {(f), ip) = Fx£A4>ix)ip{x). We will also define the 
L^-normof ^to be ||^||p = F{\(p{x)\P\x G A)^/^, and the L°°-norm to be ||^||oo = sup^.^^ 

Let K = \Gk\ = g*^, the number of polynomials of degree less than k. 

For a non-zero polynomial / G Fq[t] define the norm of / to be I/I = gdeg(/)_ Also, let |0| = 0. 
Then the norm j • | defines a distance on Fq[t]. Often, when dealing with the wraparound 
effect, wc will make use of cylinder sets. A cylinder set of radius r is simply the set of all 
/ G Fq[t\ whose distance to a given point is at most r. The cylinder sets are the analog of 
intervals in R, but they enjoy a more pleasant property that for any two cylinders, either they 
are disjoint or one is contained in the other. 

Let us call a set {/ + Pg : P G G^} a fe-configuration. If y / then it is called a non-trivial 
^-configuration. A ^-configuration in Gat is necessarily a ^-configuration in FgN , but not vice 
versa. 

For two quantities A, B, we write A = 0{B), or A <C -B, or S » A if there is an absolute 
constant C such that |A| < CB. If A and B are functions of the same variable x, we write 
A = Ox^oo{B) if A/ B tends to as x tends to infinity. If the constant C, (respectively, the 
rate of convergence of A/B) depends on a parameter, e.g. m, then we write A = Om{B) 
(respectively, A = Ojn;x-*oo)- Dependence on fixed quantities such as g or A; will be often 
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omitted. Most of the time we will be dealing with functions in N, and when it is clear we will 
remove it from the notation. Thus 0(1) stands for a bounded quantity (independent of N) 
and 0(1) stands for a function that goes to as A'' tends to infinity. 



2.2. Outline of the proof. The starting point of Green- Tao is Szemeredi's theorem, which 
states that any set of positive density among the natural numbers contains arbitrarily long 
arithmetic progressions. Actually, they needed a stronger form of Szemeredi's theorem, ob- 
tained by incorporating an argument known as Varnavides's trick [27] . In the setting of 
function fields, an analog of Szemeredi's theorem is readily available [1], [l]. Coupled with the 
Varnavides argument, this gives the following result, which we will prove in Section O 

Theorem 3 (Szemeredi for function fields). For every 6 > 0, there exists a constant c{6) > 
such that, for every function (p : F^jv R such that < (pix) < 1 for all x and E((/)|Fyjv) > 6 
, we have 

E ( n 0(/ + ^5)l/,5eF,^ 1 >c{5) 



Following Green and Tao, our next step is a transference principle, which allows us to generalize 
Szemeredi's theorem to larger classes of (p, whose functions are not necessarily bounded. Let 
us call a measure a function z/ : Gtv — > R- A pseudorandom measure is a measure satisfying 
two technical conditions (to be defined later in Section S]) , called the linear forms condition 
and the correlation condition. 

Theorem 4 (Green- Tao- Szemeredi for function fields). Given a pseudorandom measure v : 
Fgjv — > R. Then for every (5 > 0, there exists a constant c'{5) > such that, for every function 
4> ■ FqN R such that < 0(/) < z^(/) for all f and E(/|Fgiv) > 5 , we have 

E n Hf + P9)\f,9^FgN\>c'{6)-o{l) 



This is obtained by means of a decomposition result, namely any function (p bounded by 
a pseudorandom measure can be decomposed as (p = (pi + 4'2, where cpi is a nonnegative, 
bounded function, whose average is bounded from below, and <p2 is uniform in the sense that 
it is small in a norm (the Gowers norm to be defined later) that is relevant to counting k- 

configurations. Thus the contribution of /2 in E ^flpeGfe ^if + ^9)1/^ 9 ^ E^jv^ is small, so 

that the latter is close to E ^HpeCfc + 9 ^ E^jv^ , which is bounded from below by 

the usual Szemeredi's theorem. The proof of the decomposition result in Green- Tao [13] and 
later in [26] is quite involved. Recently Gowers [8] and Reingold-Trevisan-Tulsiani-Vadhan 
[19] . [20j have found much simpler proofs of this result, the main tool being the Hahn-Banach 
theorem. Moreover, their formulations of the result are very general and directly applicable 
to our setting of function fields. 

Once Theorem|3]is established, the final step is to show that v can be constructed in such a way 
that ly majorizes functions supported on irreducible polynomials, such as (variants of) the von 
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, , , r ,. aHI 1 A / /.\ f deff(P), if f = cP^, where P is irreducible and c G F, 
Mangoldt function 4j, where A(/) = <^ ^ ^' . 
^ ' [ 0, otherwise. 

To this end, we will make adaptations of the truncated divisor sum of Goldston and Yildirim 

on their work on short gaps between primes [5] , [6] in Section [9l 

Theorem 5 (Goldston- Yildirim for function fields). For any A C V such that dp{A) > 0, 
there exist a constant 5 > 0, a pseudorandom measure v : F^iv ^ R, a function (p : F^at — > R 
and W,h ^ F^n such that the following are true for infinitely many N : 

(1) (I) is outside of {K&Gn ■ Wh + be A}. 

(2) 0<(f)<u. 

(3) E(</>|Fg^) > 6. 

(4) ||</,|U « N. 

Remarks 2.3. The introduction of W, known as the 'W-trick", is quite common in this situ- 
ation in arithmetic combinatorics, when we want to transfer results about dense sets to the 
primes. We will need the irreducible polynomials to be distributed sufficiently uniformly in 
congruence classes, and for this purpose, we will take to be a product of small irreducible 
polynomials. The value of b is chosen by the pigeonhole principle, so that the residue class of 
b modulo W occupies a large proportion of A. 



Proof of TheoremlM using Theorem^ and Theorem\^ Suppose is such that the conclusions 
of Theorem [5] holds. We partition Gat into disjoint cylinders Ci of radius q^~^, so that 
l/i ~ /2I < q^~^ for any two polynomials /i, /2 in the same cylinder. There must be a cylinder 
Ci on which the average of (j) is at least 5. Let ip = (pid- Applying Theorem H] to the function 

ijj, we have E ^HpeCfc + Pg)\f:9 ^ -^g^) — ^i'qk) ~ ^(l)- Because of the bound on the 
magnitude of the contribution of the products corresponding to trivial fc-configurations 
is 0(1). Thus for A'^ sufficiently large, ip is non-zero on some non trivial A;-configuration 
{/ -|- Pg\P € Gfe} C FqAT. A priori, this is a ^^-configuration in F^jv. Because of the definition 
of f+Pg G Ci for every P G Gk- In particular > \{f+g) — f\ = \g\, so that the above Re- 
configuration is indeed a fe-configuration in Gat. Thus {W{f+Pg) + b\P G Gk} is a non-trivial 
A;-configuration that lies entirely in A, since is supported in {h £ Gjy '■ Wh + b £ A}. □ 

Remarks 2.4. The techniques here not only give infinitely many A;-configurations, but also 

2N 

show that the number of such configurations is ^ ^t^, which is of correct magnitude in the 
context of the Hardy-Littlewood conjecture on tuples of primes. We remark that while more 
algebraic methods can generate configurations of irreducibles, e.g. the analog of the twin prime 
conjecture (p£i Section 1.10]), such methods don't give the correct bound (up to a constant). 



The next sections are organized as follows. In Section [3] we establish Theorem [3l In Section 
m we define pseudorandom measures. Next, in Sections EJ El we introduce the Cowers norms, 
dual functions and their properties, which are necessary in our proof of the decomposition 
result in Section [71 In Section |8l we introduce arithmetic functions in Fg[t]. We give in 
Section m the construction of a measure v measure that majorizes the irreducible polynomials. 

-'^Ideally, we would like to take u — A, but to verify that A satisfies the conditions of a pseudorandom measure 
we need more information about additive properties of the irreducible polynomials, which would in turn lead 
to statements analogous to the twin prime conjecture. Even if the analog of the twin prime conjecture in Fq[t] 
is known to be true, the conjectured asymptotic formula is not yet proven. 
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Sections [10] and [TT] will be devoted to establishing the pseudorandomness of u, thus finishing 
our proof of Theorem [2j 



As aforementioned, we need an analog of Szemeredi's theorem in Fg[t], namely that we can 
find non-trivial /c-configurations inside any subset of positive upper density of F|j[t]: 

Proposition 1. Given 5 > 0. Then for N sufficient large, N > Nq = NQ{q,k,6), in every 
subset A of size 6q^ of Gj\[, we can find polynomials f,gGFg[t],g^O such that f + Pg G A 
for every polynomial P £ G^. 

There are at least two ways to see this. It is an immediate consequence of a far more general 
result of Bergelson-Leibman-McCutcheon [T] : 

Theorem 6 (Polynomial Szemeredi for countable integral domains). Let K be a countable 
integral domain, M be a finitely generated K -module, pi, . . . ,pn be polynomials K ^ M such 
that pi{0) = for every i. Then for any set A C M of upper Banach density d*{A) > 0, there 
exist d G K,d ^ and a G M such that a + pi{d) G A for every i = 1, . . . ,n. 

When K = M = F q[t], pi, . . . , pn are the linear polynomials g ^ Pg, where P £ Gk, then we 
have the desired result. 

Proposition [T] can also be done by using the density Hales- Jewett theorem. Before stating it, 
we need some definitions: 

Definition 1. Given a set A = {ai,a2, . . . at}, the n-dimension combinatorial space on A is 
Cf = {(xi, X2, . . . , Xn) : Xi G A for i = 1,2, . . . n}. A combinatorial line in Cf is a collection 
of t points x^^\x^'^\ . . . E such that for some subset / / of {1, . . . , n} (the "active" 
coordinates), and fixed elements {hi)i^i, we have 



for any j = l,...,t. 

We can think of a combinatorial space as the set of all words of length n on t letters. For 
example, \i A = {a, b, c} then {axbx : x = a,h,c\ is a combinatorial line in the 4-dimensional 
combinatorial space on A. Note that a combinatorial line is a stronger notion than a geometric 
line in that it "looks like" a line in every reordering of the underlying set A. 

The classical Hales- Jewett theorem [H], |10) says 

Theorem 7 (Hales- Jewett). For every r,t, there is a number HJ[r,t) such that if n > 
HJ[r,t), if the points of the n-dimensional combinatorial space on t elements are colored by 
r colors, there exists a monochromatic combinatorial line. 

The density Hales-Jewett theorem says we can always locate such a line in the most used color. 
It is to the classical Hales-Jewett theorem like Szemeredi's theorem is to van der Waerden's 



3. Szemeredi's theorem in function fields 
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theorem. The only known proof is due to Furstenberg and Katznelson [3] and uses ergodic 
theory. 

Theorem 8 (Density Hales- Jewett). For every 5 > 0, t G Z+, there is a number HJ{6, t) such 
that if n > HJ{r,t), in every subset of size of the n- combinatorial space on t elements, 
there is a combinatorial line which lies entirely in this subset. 

Proof of PropositionUl using the density Hales- Jewett theorem. We consider the set X of all 
fc-tuples (ao, ai, . . . , Ok-i) where Oj G Fg. Its cardinality \s K = . Let A^'o = HJ{6, K), then 
GkNo can be identified with the combinatorial space of dimension Nq over X, by identifying 
a polynomial with its qN^ coefficients, divided into A'^o blocks of length k. By the definition 
of A'^o, if we choose Sq'^'^'^ points out of GkNo} there must exist a combinatorial line. It is easy 
to see that a combinatorial line in this space corresponds to a non-trivial /c-configuration in 
Fg[t]. Clearly if the statement is true = A'o, then it is true for all A^ > A'q. □ 

A Varnavides argument shows that not only is there such a /c-configuration, but there are in 
fact many of them: 

Proposition 2. Given 5 > 0. then there is a constant c{6) > such that, for N sufficient 
large, in every subset A of size (5g^ o/Gat, we can find at least c{5)q^^ k- configurations. 

Proof. Let m = No{q,k, |), where A'o is the function in Proposition [H and suppose N > m. 
We claim that among g^Af-m TTj-configurations in Gtv, there are at least > |g2A-2m q£ them 
on which the density of A is at least |. 

Indeed, let us count the number of pairs (y,h) where ^ is a m-configuration in Gjv and 
h G Any. On the one hand, since for every given point in Gat there are q^~^ m-configurations 
in Gat containing it (why?), the number of such pairs is 5q^q^~^ = 6q'^^~"^. Each k- 
configuration V on which the density of A is at most 5/2 contributes at most |g™g2A-2m _ 
Iq^^^*" pairs. Thus the contribution of those V on which the density of A is at least 6/2 is 
at least Sq^^""^ — ^q^^~^ = ^q^^~^. Therefore, the number of m-configuration V on which 
the density of A is > | is at least |g2A-2m^ 

From the definition of m, each such m-configuration (with the exception of at most q^ triv- 
ial m-configurations) contains a non-trivial A;-configuration. The number of times a given 
^-configuration is counted is at most q^~^ (why?). Thus the number of non-trivial k- 
configurations in Gjv is at least (^'s-™-(|g2A-m _ ^N^ ^2A^ ^ desired. □ 

From this Theorem [3] easily follows. 

Proof of Theorem\^ Suppose E((/)|GAr) > 5. Let B C Gn be the set on which is > 5/2. 
Then \B\ + ^{q^ — \B\) > 5q^ , so that \B\ > Proposition [2] implies that B contains at 

least c(f )g2A A;-configurations. Thus E (Up^g^ </*(/ + Pg)\f,9^ ^q^) > (f )^c(f ). □ 

Remark 3.1. In contrast with the usual Szemeredi theorem for the integers, where we have a 
quantitative proof due to Gowers [7J, the proofs of the Bergelson-Leibman-McCutcheon 
theorem uses ergodic theory and therefore do not give any bound of c{5) in terms of 5. As for 
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the density Hales-Jewett Theorem, As for the density Hales-Jewett theorem, just until very 
recently, Gowers et. al. [T^ announced to have found combinatorial proofs, from which some 
bounds might be extracted, but still far weaker than what is known for the integers. Thus 
we don't seek to find a bound for the first occurrence in the irreducible polynomials of the 
configurations {/ + Pg\P G G^}. 

4. Pseudorandom measures 

A measure is a functioE0 v : F^jv [0,oo). Strictly speaking, should be called a mea- 
sure rather than where /i is the normalized counting measure on Gat (and therefore we 
should think of v as the Radon-Nikodym derivative of a random measure with respect to the 
normalized counting measure). However, we keep this naming to be consistent with [13]. A 
pseudorandom measure is a measure satisfying the two conditions defined below: 

Definition 2 (Linear forms condition). We say that a measure v : F^jv — > oo satisfies the 
(mo, no, A;o)-linear forms condition if whenever we have m < rriQ linear forms in n < no 
variables ipi, . . . , tpm ■ (Fgiv)" — > Gat of the form 

n 

V'i(f) = Lij ft + hi 
i=i 

such that all the coefficients Lij are in the selH {^\f,g G G^p}, and no two of the vectors 
(Ljj)i<j<„, i = 1, . . . , m, are proportional, then we have 

E {u{M^)) . . . i^(Vm(f))|f G (F^iv)") = 1 + O^_oo(l) (1) 

In particular v satisfies the linear forms condition then E(z^(/)|/ G Gn) = l + o(l). Note that 
we require the OAr^oo(l) to be uniform in all choices of 6i, . . . , 6^ G F^jv. 

Definition 3 (Correlation condition). We say that a measure v : F^jv — > oo satisfies the 
lo-correlation condition if whenever we have / < Iq linear forms of the form f + hi, . . . f -\- hi 
with hi, h2, ■ ■ ■ hi G F^jv , then 

B{iy{f + hi)...iy{f + hi)\feFgN)< ^i^i - hj) (2) 

where r is a function Gn — > R-"*^ having the property that E(t(/)p|/ G F^jv) = Op(l) for 
every p > 1. 

The point is that the function r is not necessarily bounded as tends to infinity, but its 
L^-norm is always bounded. 

Definition 4 (Pseudorandom measures). A measure : F^jv — > oo is called /c-pseudorandom 
if it satisfies the {K2^^^, 3K — 4, A:)-linear forms condition and the ^-correlation condition 
(recah that K = q^). 

^More precisely, it is a family {i^jv}jvgz+ such that for each N , vn is a function from F w — > R. 
Note that this set can be embedded in to F^w for every N > ko- Of course, the embedding depends on our 
choice of the irreducible polynomial /jv underlying F^n . 
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Remark 4.1. The exact values of the parameters mo,lo, fcoi^o are not important, since we will 
see that for any (mo, lo,ko,lo), we can find a measure that satisfies the (mo, Iq, A;o)-linear forms 
condition and the Zo-correlation condition. However, it is essential in the construction that 
these values are finite. 

Prom now on we will refer to /c-pseudorandom measures as pseudorandom measures. 

Lemma 1. // v is pseudorandom, then so is V\j2 = (i^ + l)/2. More generally, for any 
0<a<l, Va = — ol)v + a is also pseudorandom. 

In practice we will be dealing with Vii2 and 2^1/4. 

Proof. Suppose we want to verify the condition ([T]) for u + \. If we expand the product of m 
factors + we will have 2™ terms, each of them is a product of m or fewer factors of the 

form i'('0i). By the linear forms condition for v we know that each such term is 1 + o(l), thus 
the linear forms condition for 2^2/2 follows (with possibly different o(l) term). The correlation 
condition is checked similarly. The proof for z/q is similar. □ 

5. GOWERS NORMS 

One efficient tool in counting linear patterns is Gowers norms. The Gowers norms are first 
used by Gowers in his proof of Szemeredi's theorem [7], [8]. It has a parallel counterpart in 
ergodic theory, known as the Host-Kra seminorm |15j . 



Definition 5 (Gowers norm). Let G be a finite abelian group, (j) ^ complex- valued function 
defined on G. For lo = (wi, . . . uJd) € {0, l}'^, let \uj\ = uji + . . . + 0;^. Also, let C be the complex 
conjugation. We define the d-th Gowers norm of (f) to be 

1/2'^ 

E^,fei,...,/,,eG n C\'^^<l){x + ^ihi + ...+uJdK 



Alternatively, the Gowers norms || • ||[/ti(G') can be defined recursively as follows: 



where (pt is the function (ptix) = <l){t + x). 

The following facts about the Gowers norms are standard and the proofs can be found in [13 
or 



Proposition 3 (Gowers-Cauchy-Schwarz inequality). Suppose (p^^, for to £ {0, l}'^, are 2'^ 
functions: G — > C. Then 

E Yi </'a;(a; + a;i/ii H \- uJaha) \ < J| ll</'a;||c/rf(G) 

\we{o,i}d / we{o,i}'* 
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Proposition 4. For every (j), the sequence \\(t>\\u<i(G)i'^ ~ li^, ... is an increasing sequence. 
In particular for every d > 1, > ||(/'||[/i(g) = |E((^|G)|. 



ex 



Proposition 5. For every d > 2, || • ||t/£i(G) is indeed a norm on C , the space of compl 
functions on G. 

Henceforth, if the context is clear, we will assume G = F^n and omit the group in the notation 
of the Gowers norm. In practice we will be dealing with the U^^^ norm, where K = q^ . 

Our first observation is that a pseudorandom measure is close to the uniform measure in the 
U^^^ norm. 

Lemma 2. Let u be a pseudorandom measure on F^n. Then — = o(l). Conse- 

quently, \\i^\\uK-i = 1 + o(l). 



Proof. The proof is exactly the same as in Lemma 5.2 in [13j, so we reproduce it briefly here 
for the case K = 3, i.e. ||z/ — l||[/2 = o(l). Equivalently, we have to show 



W-l\\tj2 = E - l){u{f + hi) - l){l,{f + h2) - l)iu{f + hi + h2) - 1) /, /ll, /12 G 



JV 



If we expand the expectation, we will have a sum of 16 terms, each term is the expectation 
of a product of v composed with linear forms. By the linear forms condition, each term is 
1 + o(l). Thus our expression is Ylicii 2}(~1)'^'(1 + ^(1)) = 1 + o{l), as required. □ 

As mentioned before, Gowers norms are effective in counting linear patterns, as witnessed by 
the following 

Proposition 6 (Generalized von Neumanij^). Suppose v is pseudorandom. Let [(j)p)p,^Q,^ he 
functions hounded in absolute value by v. Then 

g G FqN < nun ||0p||[/k-i + OAr_^oo(l) 
Before proving this proposition let us derive the following 



Corollary 8.1. // the 4>p are bounded by 3 + u then E I^HpeGfe ^pif + ^d) 
4^ minp \\(j)p\\jjK-i + OAr^oo(l)- 

Proof. Just divide each (^p by 4, and apply Proposition[6]for functions bounded by the measure 
= (z^ + 3)/4, which is pseudorandom according to Lemma [H □ 

Proof of Proposition^^ The proof is exactly the same as in Proposition 5.3 in [TH], so we just 
reproduce it briefly here. Since is a subgroup of (F^jv, +), by a change of variable if need 

be, it suffices to show that E (ripeGfe 4>p{f + Pg)) f,g ^ Gat) < ||(/)o||[/k-i + OAr^oo(l)- 



^Green and Tao call this type of inequalities generalized von Neumann theorems to emphasize their connec- 
tion with the classical von Neumann theorem in ergodic theory. 
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We claim something slightly more general, namely that for any m distinct non-zero polynomials 
Pi,P2,...,Pm ^ Gfc, we have 

E [Mf)Mf + Pig) ■ ■ ■ + Pm9)\f.9 G Fgiv) < \\U\U^ + ON~,oo{l) 

Proposition El is a special case of this when m = K — 1. For y = (yi, . . . ,ym) £ (Fgiv)™ we 
define the linear forms 

^o(y) = 2/1 + y2 H hym 



Thus for every i = 1, . . . ,m, '^i{yi, . . . ,ym) does not depend on j/j, which will be crucial in 
our later use of Cauchy-Schwarz. If / = Uiid = ~ Yll^i p^' then / + P^g = ^'i(y). It is 

easy to see that the map y ^ (/, g) is g^(™~^)-to-one, so that 

(m \ 

For simplicity let us treat the case m = 2. The argument then extends straighforwardly to 
the general case. Let's call the left hand side Jq. Since (/>2 is bounded by z/, we have: 



Jo < E (u{^2{yi)) E Ui(^'i(y2))'/'o(^o(yi,y2)) 



y2 e F„N 



yi G F„N 



By Cauchy-Schwarz. 

j2 < E(z.(^2(yi)) yi eF„iv)E{z.(^2(yi))E(0i(^'i(yi,y2))(/'o(^o(yi,y2)) 



y2 G FqN 



yi G Fgiv 



< (l + o(l))Ji 

where we have used the linear forms condition for ^2{yi), and 
Ji = E ( E (0i(^'i(yi,y2))0o(^o(yi,y2)) y2 G F„^)%(^'2(yi)) 



yi G F„ 



E(0i(*i(y2))0o(*o(yi,y2))</'i(^'i(y2))'/'o(*o(yi,y2))i^(^2(yi)) yi,y2,y2 g f 



yi G Fgiv 



yi G FgN 



= E (^0i(^'i(y2))0i(^'i(y2)) E (^(/.o(^'o(yi,y2))0o(^o(yi,y2))i^(^2(yi)) 
= E (^ij{^i{y2)M^i{y2)) E (0o(^o(yi, y2))</'o(^o(yi, y2))i^(^2(yi)) 

Note that we have eliminated (p2 in Ji- Next, again by Cauchy-Schwarz, 
J2 = E (z.(^i(y2))i/(^i(y^)) y2,y2 G F„^) x 



2/2, y2 G F 



2/2, y2 G F 



xE E((/)o(^'o(yi,y2))'/'o(^o(yi,y2))i^(^2(yi)) yi gF^) iy{^i{y2))iy{^i{y'2)) y2,y2 gF, 



< (1 + 0(1)J2 
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where we have used the hnear forms condition for the forms ^i{y2) and ^'1(^2), and 

J2 = E ( E (M^oiyi,y2))M^o{yi,y'2)H^2{yi)) yi g f nY u{^i{y2))u{^i{y2)) ^2,2/2 e f. 



e(0o(^o(2/i, y2))M^o{yi,y2)M^o{y'i,y2))M^o{y'i,y2)) x 
xu{^2{yi)M'^2{y'i)H'^i{y2)M^i{y2)) yi,y'i,y2,y2 e F„ 



Note that we have ehminated (pi in J2. Recall that ^'o(yi, 1/2) =2/1+^2- Let us re-parameterize 
the cube { (y 1 , y2 ) , ^0 (yi , y2 ) ' ^0 (y'l , 2/2 ) , ^-0 (y'l , 2/2 ) } = { /' / + ^1 ) / + ^2 , / + /ii + /i2 } ■ Then 

J2 



Bt^MDMf + hi)Mf + h2)Mf + hi + h2) X 

xz.(^i(/))z.(^i(/ + hi))u{^2{f + h2)H^2{f + hi + h2)) /, hi, h2 G F^^ 



If it was not for the factor W{f, hi, /12) = v{^i{f ))v{^i{f + hi))v{^2{f + h2))u{^2{f + hi + 
/i2)), then J2 would be equal to ||^o|lu-2- We have to show that J2 = ||^o||[/2 + o(l). 

Indeed, 

J2 - = E [{WU, hi, h2) - l)Mf)Mf + hi)Mf + h2)Mf + hi + h2)\f, hi,h2€ F^ 

iHfHf + hiHf + h2Mf + hi + h2) f, hi, h2 G F,N 



< E[^{W{f,hi,h2) 
By Cauchy-Schwarz, 



J2 



|4 |2 



< E(u{f)u{f + hi)u{f + h2Mf + hi + h2) f,hi,h2£F„M) X 



xE hi, h2) - ifjyifHf + hi)u{f + h2)Hf + hi + h2) 

Thus it suffices to show the following two claims: 



/>i,/i2 GF 



Claim 1. E {y{f)v{f + hi)v{f + h2)v{f + hi + /12) 



/,/ii,/i2 G F„ 



Claim 2. E /ii, /i2) - ifHfHf + /ii)'^(/ + h2Hf + hi + h2] 



1 + 0(1) 



f,hi,h2 G Fgiv 



0(1) 



The first claim follows from the linear forms condition for 4 forms. As for the second claim, 
we expand the left hand side as 

E ({W{f, hi,h2fiy{fHf + hi)iy{f + h2Hf + hi + /12) /, /n, /i2 G F^n 

f,hi,h2 G FgN 



+ F{l,{fMf + hiMf + h2Mf + hi + h2) 



2E ( {W{f, hi,h2)v{f)v{f + hi)u{f + /i2)i^(/ + /ii + /12) 

f,hi,h2 G F^. 

Using the linear forms condition for 12,8 and 4 forms respectively, we see that this is (1 + 
0(1)) - 2(1 + 0(1)) + (1 + 0(1)) = 0(1), as required. □ 

Remark 5.1. In the general case, we will need the linear forms condition for K2^~^ linear 
forms in 3K — 4 variables. 

Remark 5.2. Our use of this generalized von Neumann inequality follows Green- Tao and thus 
is genuinely different from Gowers'. In his proof of Szemeredi's theorem, Gowers used the 
following fact, which is now known as a weak form of the the Gowers Inverse Conjecture |12j : 
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If the Gowers norm of a function is large, then it must correlate locally with a polynomial 
phase. Green and Tao used this inequality for their transference principle, namely to transfer 
Szemeredi's theorem from the uniform measure to pseudorandom measures. 

6. Gowers anti-uniformity 
Definition 6. For a real function (j) on F n, define its dual function T^d'^ by 



Prf(/. = E| Yl (j){x + uihi + . . . + LJdhd) 



x,hi,...,hde FgN 



From the definitions of the Gowers If^ norm and dual functions it follows immediately 
that 

Lemma 3. ((/»,Pd</>) = im^df^c) 

From a functional analytic point of view P^c^ may be regarded as a "support functional" of 
(j), with the difference that "Dcj) is not linear. From now on we will be working with the U^~^ 
dual functions We will be particularly interested in the dual functions of functions bounded 
by a pseudorandom measure v. 

Lemma 4. //O < < z^, then {(j),I)K-i(l)) = 1 + o(l). 

Proof. This follows from Lemmas [2] and [3l □ 

Lemma 5. For every m there is a constant C{m) such that if < (pi, . . . ,(pm < i^, then 
||Pi<-_i(/>i • • ■VK-i(t>m\\*uK-i ^ C{m), where \\ ■ is the dual norm of \\ • ||(7A'-i (defined in 

the usual way ||/||^k-i = sup{|(/,5r)| : Hs-IIi/a'-i < 1}). 

This is by far the most important property of the dual functions, and perhaps surprising, since 
m is not bounded, while the number of forms in the linear forms condition and correlation 
condition is bounded. However, this comes from the fact that the exponent p of the function 
T in the correlation condition is not bounded. In Reingold-Trevisan-Tulsiani-Vadhan's lan- 
guage this means that v is indistinguishable to the uniform measure according to the family 
{Vx-icpi ■ ■ ■ VK-i(t)m : < <?!)j < z/}. 

Proof. The proof is exactly the same as in Lemma 6.3 in |13) . so we will reproduce it here for 
the case IT = 3, m = 2. It suffices to show that for any function if) with ||V'||j/2 < 1, we have 
{7p,V(j)iV(j)2) = 0(1). We write out this as 

/ii,/i2 e FgNj X 



/ii,/i2 G FgN ) X 



F^[^lj{f)B l^Mf + hi)Mf + h2)Mf + hi + h2 
xE (02(/ + ki)(t>2{f + k2)Mf + ki + k2)\h,k2 G F^^) |/ G 
Ef^(/)E (Mf + gi)Mf + h2 + g2)Mf + hi+gi + h2 + 52) 



xE 02(/ + ki+ gi)Mf + k2 + g2)Mf + ki+gi + k2 + 52' 



ki,k2 G Fgjv 



f,gi,g2 G F„ 
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We rewrite this as 

e(e(V'(/)</)i(/ + 51 + hi)Mf + 91 + h)Mf + 92 + h2)Mf + 92 + k2) x 



><Mf + 92 + hi + h2)Mf + 92 + ki + k2] 



f,9i,92 G F„ 



By the Gowers-Cauchy-Schwarz inequahty this is at most 



E 



C/2 



M- + hi)M- + ki, 



ii- + h 



2 m\ 



hi, /i2, ki,k2 G Fqjv 



+ k2 



M- + hi + h2)M- + ki + k2) 



1/2 



hi, /i2, ki, k2 G F 



By Holder's inequahty, and since 
E(||<^i(- + /i)(/<2(- + A:)||^2 



h,k ^ Fgiv 



U2 <1, this is at most 

1/3 



< E 



nv 



{. + h)<t^2{- + k)\\lj. 



h,k £F„ 



1/4 



lc/2 



h,k e FgN 



0(1). If we expand this out 



Thus it suffices to show E + h)(j)2i- + 

then it is equal to 

e(e(0i(/ + h + gi)Mf + k + gi)Mf + h + g2)Mf + k + g2)x 



f, 91,92 G Fgiv 



h,k € Fgivj 



X(j)i{f + h + gi+ g2)(t>2{f + k + gi+g2) 
If we interchange the order of summation then this is equal to 

B(B(Mf + h + gi)Mf + k + gi)Mf + h + g2)Mf + ^ + 52) x 



x0i(/ + /i + 51 + g2)Mf + k + gi+g2) 



f,h,ke F„ 



51,52 G FgN^ 



E EUi(/i + gi)Mk + 9i)Mh + 92)Mk + 92) x 



x0i(/i + 5i + 92)4>2{k + gi + 52] 



h,k £F„ 



91,92 ^ F jv 



< E E + gi)iyif + g2)v{f + gi+ 92] 



/GF, 



51,92 G F„ 



According to the correlation condition, and the triangle inequality, this is at most 



E ( {r{gi) + T{g2) + T{gi - 52) + T{gi + 52))' 



51,52 G F 



< ('E(r(5i)2|5i,52 GF„iv)i/2+E(r(52)'|5i,52 GF„^; 



1/2 



+E(T(gi - 52)^151,52 G F„iv)^/2 ^ E(r(5i +52)^151,52 G F„^; 



1/2 



4E(t(5)'|5 G F, 



0(1) 



as required. 

Remark 6.1. In the general case, we will need the correlation condition for 2^~^ forms. 



□ 



7. A DECOMPOSITION AND A TRANSFERENCE PRINCIPLE 



In this section we reproduce Gowers' proof ^ of the Green- Tao-Ziegler theorem and use the 
latter to derive Theorem HI The reader is nevertheless recommended for a reading of the 
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original paper for a survey about the interplay between decomposition results and the use of 
the Hahn-Banach theorem in arithmetic combinatorics. 

We first forget for a moment the definitions of Gowers norms and dual functions, but instead 
axiomatize their properties as proved in Lemmas EJ and [5l Consider a finite set G and let 
R*-' be the set of all real functions on G with the inner product {f,g) = ExeGf{x)g{x). 

Definition 7. We say that a norm || • || on R*^ is a quasi-algebra predual norm with respect to 
a convex, compact set J- C R*^ if there is a function c : R"*" — > , a function C : Z"*" —>■ R"*", 
and an operator V : Hp — > R'-' such that the following hold: 

(1) {f,Vf) < 1 for every f e J". 

(2) {f,Vf) > c(e) for every / G ^ with ||/|| > e. 

(3) \\Vfi . ..VfmW* < G{m), where || • ||* is the dual norm of || • ||. 

(4) The set {Vf, f G is compact and spans R*^. 

The reason why || • || is called a quasi-algebra predual norm is that the dual norm || • ||* is 
"close" to being an algebra norm (this will be made precise in Lemma [7|) . The application 
we have in mind is when G = F^n, \\ ■ \\ is the (normalized) U^~^ Gowers norm, the I?/ are 
the (normalized) U^~^ dual functions, J- is the space of nonnegative functions bounded by a 
pseudorandom measure u. 

Associated to the norm || • ||, we will also consider the norm H^Hbac = : / G JT} 

and its dual || • H^^^" (Since the set {Vf, f G J^} is compact, || • \\bac is indeed a norm). Here 
BAG stands for Basic Anti- uniform Correlation. Thus || • || and || • \\bac are equivalent in a 
sense that if f £ and ||/|| > e then ||/||_b^c ^ c(e). 

The following gives a simple characterization of the || • \\*bac norm. 
Lemma 6. \\f\\*^^c = inf{Eti \M ■ f = EIi ^Vfi, /i, . . . , A G .F}. 

Proof. This can be proven using Farkas' lemma [22, Section 1.16] (which is another incarnation 
of the Hahn-Banach theorem). We can also do this in a relatively simpler way as follows: 
define the norm ||/||o = va.i{Y!l=i \K\ ■ f = Yli=i h^fi-, /i, • • • , /fc S J^} (which exists by our 
assumption that the P/, j £ T span R*^), we have to show that the dual norm || • ||q is equal 
to II ■ \\BAC (note that here we are using Hahn-Banach implicitly!). 

Suppose G R'^. For any decomposition / = Ya=\ ^i'^h^ /i, • • • , fk e J^, we have \{g, f)\ = 
\Ei=l>^^{9,'Df^)\ < Etl|A^|sup{|(g,P/)| : / G Thus \{g, f)\ < ll/llolbllBAC for every 
/, so that \\g\\l < ||5||bac- 

For the other direction, suppose H^Hbac = 1- Then for every e > 0, there exists f £ such 
that \{g,Vf)\ > 1 - e. Note that ||P/||o > 1- Thus \\g\\*Q > 1 - e, for any e > 0. This shows 
that ||5r||o > II^IIbac for any g G R*^. Therefore, || • \\bac = II • |lo- ^ 

We now see that the name "quasi-algebra predual" is justified by the following: 
Lemma 7. If ^ e R*^ is such that \\4'\\*bac ^ 1' ^^^^n HV'^H* < C(m). 
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Proof. If 1 1 1 1 BAG — ^' then for every e > 0, ip can be written as a linear combination of 
functions T>f, f G J- and the absolute value of the coefficients adding up to less than 1+e. Thus 
■ip"^ can be written as a linear combinations of products of m functions from {T>f : f £ J^}, 
with the absolute value of the coefficients adding up to less than (1 + e)™. Since the || • ||* 
norm of every product of m functions from {Df : f £ J^} is at most C{m), we conclude that 
11-0™!!* < C(m). □ 

Specializing to the case where J- is the set of all nonnegative functions bounded by a function 
u E R*-', we claim that any function from can be written as the sum of a bounded function 
and another function small under || • ||. This is the content of the Green- Tao-Ziegler structure 
theorem. 

Theorem 9 (Green- Tao-Ziegler structure theorem, [T3], For every ry > 0, there is 

e = e{ri, C,c) > such that the following holds: Let v he a measure on G such that — 1|| < e, 
Eg(z^) + and all properties in Definition^ hold for = {f : < f < f}- Then for 
every function f £ T, f can he decomposed as f = g + h, where < 5 < 1 + r/ and \\h\\ < r/. 

Proof. Suppose such a decomposition doesn't exist. Since ||/i||_bac < c{r]) implies \\h\\ < r], 
this implies that / cannot be expressed as the sum of elements from two convex sets Xi = 
{0 < 5 < 1 + ??} and X2 = {\\h\\BAC < c(r?)} in R^. 

Claim 3. There is a function Tp G R*^ such that (fjip) > 1? but {g,ipi) < 1 and {Ijtp) < 1 for 
every g £ Xi, h £ X2. 

Proof. Let X = Xi + X2, then X is convex and closed. We invoke the following form of the 
Hahn-Banach theorem: if / ^ X, then there is a linear functional (•,'0) on R*^ such that 
(/, ip) > 1 and {g, ip) < 1 for every g £ X. Since Xi and X2 both contain 0, Xi and X2 are 
contained in X and the claim follows. □ 

The condition (gjip) < 1 for every g £ Xi implies that EgiP+ < j^, where ip+{x) = 
max(0, -0(3;)). The condition {h,ip) < 1 for every h £ X2 implies that ||^||^ac — '^(^)~"^- 

Claim 4. For any r( > 0, there is a polynomial P = P{r],r]' ,C,c) and a constant R = 
R{r],r]',C,c) such that \\Pip - Tp+\\oo < r]' and UPVll* < R- 

Proof Since HV'IIbac - ^(f?)""^, by LemmaHwe have \\ip\\* < Ci = C{l)c{r])~^. By Weier- 
strass' approximation theorem, there is a polynomial P{x) = anX^ + • • • + ao such that 
\P{x) — max(0,j;)| < -q' for every x £ [— Ci,Ci]. Then clearly \\P')p — V'+lloo < Next 
we claim that H-PV^II* is bounded (independent of V')- By the triangle inequality it suffices to 
show this for HV''"!!* for each m. But this follows from Lemma d □ 

We now have 1 < ^Gf'4^+ ^ Egz^^/'+. We split the later as 

Egj^V+ = EgV+ + Eg(i^ - + Eg(j^ - 1)(V+ - Pi^) 

Also, |Eg(z^ - l)Pi^\ < \W - lllll-PV'ir < e^, and |EgJ^(V'+ - P4^)\ < (EgJ^)||PV' 
r)'{l + T)). Thus 1 < + ri'{l + rj) + eR . If we fix a small value of rj' (e.g. rj' 
do), then this is a contradiction is e is small enough. 



- V'+lloo < 

= 7?/12 will 

□ 
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Let US now formulate the result in the setting of Gowers norm and pseudorandom measures 
on Fgjv: 

Corollary 9.1. Let v he a pseudorandom measure on F^jv. Then for every r] > 0, for 
N sufficiently large, every function (f) on F^jv such that < </> < i^, can he decomposed as 
<P = (pi + 4>2, where < 0i < 2 + r/ and ^2 is uniform in the sense that \\(t>2\\u'i-'^ ^ 



Proof. If < (/) < z^, then < ^ < = already know that is also pseudoran- 

dom. Let G = F^jv and T be the space of all nonnegative functions bounded by 1^1/2- Let us 
check that the normalized Gowers U^~^ norm ||0|| = is quasi-algebra predual with 

respect to where P0 = ^'Dk-i4'- Thanks to Lemmas [3l HI [U the first three conditions in 
Definition \7\ are met. The only thing left to check is the forth condition, i.e. the set of dual 
functions Vcfi spans R*^. Note that if is a point mass, then Vcf) is also a point mass (at the 
same point). Since 1^1/2 is pointwise positive 1, {Vcf) : (f> G J^} contains masses at every point 
of G, hence spans R*^. 

By Theorem [9l there is e = e{ri) > such that we have a decomposition 

(j) V w w V 

= 4>i + 4>2 where < 0i < 1 + - and ||(/)2||(7if-i < ^ 

as soon as Egz^i < 1 + and — 1|| < e. But this is always true since vi is a pseudorandom 
measure. Such a decomposition for 0/2 gives the desired composition for (j). □ 

Remark 7.1. If instead of z^i/2 we consider = (1 — a)v + a, where q > is sufficiently 
small depending on r/, we can actually show that there is a decomposition = 0i + 02) where 
< 01 < 1 + and ||02||c/^-i < but this is not important. 



With this in hand, we can now prove Theorem [3J 



Proof of Theorem^ using the Green- Tao-Ziegler structure theorem. We know that for every 
77 > 0, for sufficiently large (depending on r/), every function bounded by a pseudorandom 
measure on F^jv can be decomposed as = 0i + 02, where < 0i < 2 + r/ and ||02||[/k-i < r]. 
In particular |E02| < r], so that if E0 > 6, then E0i > 6 — r]. Write 

U Hf + P9)\f,9 G F,^. = E H Mf + P9)\f,9 G F,iv + (2^' - 1) other terms 



The other terms are of the form E (^HpeGfe 4'pif + P9)\f-,9 £ ^q^j where each 0p = 0i or 
02, and not all 0p are equal to 0i. 

Since 0i is bounded pointwise by 2 + 77 and 02 is bounded pointwise by max(z^, 2 + rj) < 3 + i' 
in absolute value, by Proposition l8.lt these terms are at most 4^||02||[/x-i +o(l) in absolute 
value. 



'This is the sole reason why we work with 1/1^2 rather than with i/. 
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On the other hand, by Theorem El E (Rpgc, Mf + P9)\f,g e F^iv) > (2 + ??)^c(|^). 
Hence 

El n 0(/ + ^5)l/,9eF,^.) >(2+r?)^c(^)-(2^-l)4^r?-o(l) 
\PeGk J \ ^V/ 

By choosing r] appropriately smah, the main term on the right hand side is positive, so that 
there is a positive constant c'{6) such that E ^HpeGfe ^if + P9)\fi9 ^ E^jv^ > c'{5) — o(l) 
for every function (j) on F^at bounded by a pseudorandom measure. □ 

Remark 7.2. By running the argument carefully (e.g. by modifying (pi so that it is bounded 
above by exactly 1, and its average is exactly 5) we can show that actually c'(5) can be taken 
to be c{5). However, there is little point in doing so since we don't have an explicit value for 
c{5). 



8. Elementary arithmetic in Fg[t] 



In this section we will describe some basic arithmetic properties of Fg[t], introduce arithmetic 
functions on Eg [t] and prove some preliminary lemmas relevant to the construction of a pseu- 
dorandom measure in Section [9l We assume from now on that polynomials denoted by the 
letter P (such as P,P', or Pi) will stand for monic, irreducible polynomials. 

The units of the ring Fq[t] is Eg \ {0}. Similarly to the integers, Fq[t] is a unique factorization 
domain. More precisely, every / G Fq[t] can be written uniquely as / = cP"^ • • • V^, where 
c S Fq,ai € Z+ and the Pi are monic, irreducible polynomials. We can now introduce 
arithmetic functions on Eg[t]: 



The Euler totient function $(/), is the number of polynomials of degree less than 
deg(/) which are relatively prime to /. Then we have the following formula for ^*(/) 

1 X _ |Pf'+Vi 



in terms of its prime factorization: $(/) = |/| Ylp\f — jpyj = YlTLi \p. \ 

. The Mobius function /.(/) = | J,"^)'"' = ^ ^"^^^ i = 1, . . . ,m; 

^ ' I U, otherwise. 

• The von Mangoldt function A(/) = | ^^^(7), if m = 1; 

^ ' [ 0, otherwise. 

• d{f), the number of monic divisors of /. We have the following formula: d{f) = 

• For di,...,dm G Fq[t],di 7^ 0, denote by [di, . . . ,dm] the least common divisor of 
di, . . . , dm, in other words, the polynomial of smallest degree that is divisible by di for 
every i = 1, . . . ,m (which is defined up to multiplication by an element of Fq \ {0}). 

The zeta function Q of Fq[t] is defined by Cq{s) = Z]/ monic ]7p f^^" s £ C such that 5Rs > 1. 
We have the following closed form for the zeta function: Cqi^) = i-q^-^ ^ ^- Thus it 

can be analytically continued on the whole plane, with a simple pole at s = 1, at which the 
residue is j^. 
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Similarly to the Riemann zeta function, (^g admits a factorization as an Euler product: Cgi^) 



We have the following analog of the prime number theorem [T^ 

Proposition 7 (Prime number theorem for function fields). Let iTq{N) be the number of 

N /N/2 

irreducible polynomials of degree N in Fq[t]. Then iTq{N) = {q — 1)^ + O I ^-jj- 
Corollary 9.2. EdegiP)<N ^ = logiV + 0,(1) 

Proof. We have 

deg{P)<N ' ' n=l ^ 

N 



, 1 /n-"/2 



n \ n 



n=l 

= logA^ + 0(l) 

□ 

More generally, we have the following analog of Dirichlet's theorem on primes in arithmetic 
progressions (with a much better error term than its integer counterpart, thanks to the Rie- 
mann hypothesis for curves over a finite field): 

Proposition 8 (Dirichlet's theorem for function fields). Let a,r € Fg[t] be relatively prime, 
deg(m) > 0. Let TTq{N;a,r) be the number of irreducible polynomials of degree N in Fq[t] 

which are congruent to r (modulo a). Then TTq{N;a,r) = {q — 1) ^^^-^ ^ + O (^^^~^ 

We will need the following two lemmas in our construction of the function r in the correlation 
condition. 

Lemma 8 (Divisor bound). Let f G Fq[t]. Suppose deg{f) = N. Then d{f), the number of 

o ( ) 

divisors of f , satisfies d{f ) < q _ 

Proof. If / has the factorization / = cYYiL^P°'% then d{f) = YYlLiioi + 1). Therefore, 
1J\^ — YliLi | p." | ti 1 where e is to be chosen later, possibly depending on /. 



Note that, if deg(Pi) > 1/e, then 



Oi + 1 ai + 1 
■At — < — — < 1 



If deg(Pi) < 1/e, then 



+ ! < + l < < 1/e 
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Since the second case can occur for at most values of Pj, we have ^^vr ^ {q^l^^^^^ = 
^Ne+-q I ^ 'pj^us for cvcry e > 0, 

O ^ 

for every e > 0. If we choose e = 1/log A^, then we have d{f) < q ''\^°snJ^ required. □ 
Lemma 9. Let S be a finite set of irreducible polynomials in Fq[t], then for every K , 

\oR^\P\\ 



This bound is perhaps surprising, since it is uniform over all finite subset of the irreducible 
polynomials. 



Proof. We have 



°° JT" 1 



exp 



P / ^ n\ ^ Pi---Pn 

Pes ' '/ n=i Pi,...,p„e5 ' ^ 



< 



n=l / n^c n r> ^ c i 



Pas Pi,...,P„_ie5, 

deg(P,)<dog(P) 

n-1 



^^Y\P\Y (n-l) 

Pizs I I n=l ^ ' 



I \ 

y A 

\deg(P')<deg(P) / 



By Corollary [921 we have that ^ p'e5, Tpn logdeg(P). Hence 

deg(P')<deg(P) 

-p(Em) E^E7£^'°<5"-'*g(^) 

\Pes ' ' / Pes ' ' n=i ^ ^' 

«^ ^^exp(i^log(deg(P))) 



1^ 

Pes ' 



log^ I PI 



Pes ' ' 

as required. □ 

In our proof of the Goldston-Yildirim estimates (Propositions [9| [TOl II 1 p in the next sections, we 

will be concerned with Euler products in several variables, i.e of the form Yip ^1 — Yl^=i j/^pfcj^ , 

as ^Sj > and Sj — > uniformly. The following lemma gives an asymptotic formula for such 
Euler products. 

Lemma 10. Let P range over monic irreducible polynomials in Fq[t]. For every P let 
cp^i, . . . , cp^n be real numbers such that \cpj\ < 1 and cpj = Cj for P outside a finite set 
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S. Let si, . . . , Sji & C be such that > and sj — o(l) uniformly. Then 

n ( 1 - E ^1 = Gil + on(i)) n (i + On (^) ) n 

P \ j=i ' ' / Pes ^ Ml// 

whereG = Wp (l - ^Ipi) (l - ^) 

Note that the 0„ and o(l) depends only on n and the rate si, . . . , — > and not on the 
exceptional set 5. 

Proof. Note that fl - ^%^') fl - m)"^''^ ^'"^ = 1 + 0„ fm) if P G cS and 1 + 



|P| y |p|y - X ^ y\p\ 

On (j^) if P 5. In particular the product defining G converges. 

Let us now look at the expression 



^ \p\i+sj j n |P|i+^. ) 



For P outside of 5, an easy calculation (by calculating the partial derivative of the expression 
with respect to each Sj) shows that it is equal to 



PI /V \P\ V V IP 



12 



For P G 5, we just bound it crudely by 1 + 0„. (^f^ j ' which is equal to 



V 1^1 / V \P\J V 

Multiplying these estimates over all P, (and noting that JIp (l + (iff)) = 1 + 
we have 

n f 1 - E nSij) = + on(i)) n c.(i + «.)-^^ wii+on (^) ) 

P \ j=i ' ' / j=i Pe5 ^ VII// 

Writing out Cg(l + Sj)""^ = 1 - q'''' = (1 + o(l))sj logg, we have the desired estimate. □ 



9. A PSEUDORANDOM MEASURE THAT MAJORIZES THE IRREDUCIBLE POLYNOMIALS 

In this section we prove Theorem [5] by constructing a pseudorandom measure v. The proof 
of its pseudorandomness is however deferred to the next two sections. Recall that our task is 
to find a pseudorandom measure v such that v majorizes a function (j) which is supported on 
A and such that E((?!)|Fgjv) > (5, where d-p{A) > 0, and 5 is a positive constant depending on 
dp{A) alone. Throughout this whole section and the next two, polynomials denoted by the 
letter d (such as d, d' or di) will stand for monic polynomials. 
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Let's fix once and for all 



• R = aN, where a is a small constant depending only on k. 

• w = w{N), a function tending sufficiently slowly to infinity. We may take w{N) <^ 
logiV. 

• W = ridegCPX.. P- We have that W{t) = t'?™ - t, so thai! degiW) <C N. We wih 
see that eventually we can take tt; to be a sufficiently large number, hence to be a 
sufficiently large polynomial. 

• X : R- ^ R a smooth functiorQ supported on [-1,1] such that x(0) > and {x'{x))'^dx = 
1. 

• Aji{f) = d\f, fJ'{d)x i^^^w^r Goldston-Yildmm divisor sum. 

dcg{d)<_R ^ ^ 

• i'{f) = i^bif) = -^ "^[1^^ Ar(M^/ + b)"^ for some appropriate b such that < deg(6) < 
deg(M^), gcd(6, M^) = 1 to be chosen later. 



Proof of Theorem\^ under the assumption that v is pseudorandom. Notice that if / is irre- 
ducible and deg(/) > R then Anif) = x(0). For f £ Gn let 

_ ^^(j) _ I xiOy^^R, iiWf + b is irreducible and deg(VF/ + b)>R; 
1 0, otherwise. 

Then clearly < (f) < v and ||(/>||oo ^ Thus it suffices to find b such that X^jgQ^^ 4'b{f)^Wf+beA ^ 
6q^ for some constant 6 > 0. 

Let us take the sum ^jgQ^ 0b(/)lvK/+be^ over all 6 such that deg(6) < deg(VF), gcd(6, M^) = 
1. It is easy to see that it is equal to 

tt{/i eA,R< deg{h) <N + deg{W)}-^R 

By the prime number theorem in Fq[t] (Theorem [7|), jjj/i £ V, R < deg(/i) < + deg(Ty)} = 

('?-i)ivfS(lr)(i + '^W) 

Since N + deg(Ty) increases at most linearly in A^, and since dp{A) > 0, we conclude that 
there is a constant 6 > depending only on A such that X^/eGjv 'Pbif j^Wf+beA ^ ^[yj^ '^g^ 
infinitely ofterH. Thus, for infinitely many A^, we can find b such that Xl/eGiv 'Pb{f)^Wf+b<^A ^ 
Sq^ , as required. □ 



From now on let us assume without loss of generality that 6=1. (Note that if ^ = we can 
always take b = 1, thanks to Dirichlet's theorem in Fq[t] (Theorem [8])). 



The introduction of W , alluded to earlier as the W^-trick, is meant to absorb small irreducible polynomials 
arising in the the linear forms condition. Except for this technical reason, for the most part we can go through 
the arguments pretending that W — 1 without losing the general idea. 

'''Goldston-Yildnim used a truncated sum corresponding to x{^) ~ rnax(l — |a;|, 0). As observed by Tao [23], 
the use of a smooth function allows us to perform Fourier analysis. 

^This is always true if the limit d-p(^) = Iimjv_,oo ^{/gp'dcg(/j<iv} exists. If not, then this can be false if we 
allow W to tend to infinity. However, as already mentioned earlier, we can eventually take to be a constant, 
so that the argument remains valid. 
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The only thing missing from the conclusions of Theorem [5] is to check that v is indeed a pseu- 
dorandom measure, i.e. it satisfies the linear forms condition and the correlation condition. 
This will be done in the next two sections. In order to do so, we will need estimates on sums 
of the form ^ Aij('(/'i) . . . A/j(^/;„) where the ipi are linear forms. The following proposition 
shows us how to deal with sums of this kind. 

Proposition 9 (Goldston-Yildirim estimates). Given Ji,..., J„ G Fg[i] not necessarily dis- 
tinct. Let r he the number of distinct elements in { Ji, . . . , Jn}. Also, for every monic, ir- 
reducible P £ Fq[t], let ap be the number of distinct residue classes modulo P occupied by 
Ji, . . . , J„. Put A = A( Ji, . . . , J„) = Y\{Ji ~ Ji') ) where the product is taken over all couples 
{Ji, Ji') such that Ji 7^ Jj/. Then as N ^ oo, 

AR{f + J,).--An{f + Jr.) = CGH{l + On{l))q''(^-^Y (3) 

where C is a computable constant ( not depending on N, Ji, . . . , Jn but only on the multi- 
plicities of the Ji and x), G is the "arithmetic factor" Y[p (^1 ~ (^1 ~ Jp\) ' '^^^ ^ ~ 

Up\A (l + On(^)). 

Remarks 9.1. This Proposition illustrates how the Goldston-Yildirim method works. We will 
not apply this Proposition directly, (since we will be incorporating the VF-trick), but rather 
its variants (Propositions fTOl and II ip . for which only minor modifications are needed. 



Proof. Writing out the definition of Ar, we see that the left hand side of ([3]) is 

^\f+J^Vi=l,...,n (4) 



di,...,d„eGjv \«=1 7/6 

Note that since x is supported on [—1,1], the summation over di, . . . , G Gat is the same as 
the summation over di, . . . ,dn E G^. Also, because of the appearance of the function only 
squarefree di are involved. 

Suppose R is sufficiently small compared to A^, say nR < N. Then for every di, . . . ,dn G Gr, 
we have | [di , . . . , dn] \ < ■ Therefore 

v;^ , _ g{di,...,dn) N 

jeGiv 

where g{di, . . . , dn) is the number of solutions in Gdcg{[di,...,dn]) ^he system of congruences 
/ + Jj = (mod di) for every i = 1, . . . ,k. We will come back to the analysis of g{di, . . . , dn) 
later. 

We can now rewrite the expression Q as 

k 

N g{di,...,dn) Y\ /, N Meg(di)\ 
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Writ^ x(2^) = (ill other words, ■0(0 = ^ogqq^x{x){t^ogq), where ip is 

rapidly decreasing, i.e. 1^(1) = Oa{{^ + 1^1)""^) for every A. Thus for every m = 1, . . . ,n. 



■oo 



oo 



-oo 

We spht the expression ([6]) into a main term of 

\a.m\ « V{tm)dtm 



N sr^ g{di,---,dn) T-r (i \ r 



plus an error term, which is 

^ gjdl, ■ ■ ■ , dn) 

ll,...,dn 

<.A q R 2^ J-TT 



Note that 

TT ( • • • dn\~^^^ 



ri(i+^^+0".(^)) 



(7) 



Note that the product ^1 + O ^j^^^ is absolutely convergent, while Wp ^1 — ^ i ^ = 

Cg(l + = i_q-i/R = 0{R). Thus by choosing j4 > n, we see that the error term ([7]) is 
o{q^). In particular the sum in ^ converges absolutely. 

Therefore, it suffices to show that 



E l"^'"""' 1 n /^(^-) n / \dmr'^^{t^)dtm = CGH{1 + 0.(1)) 



Note that all of our expressions are in terms of R, and we have eliminated the role of N. 
We now switch the orders of the sums and the integral (which is legitimate since the sum is 



The appearance of q here is for mere aesthetic reasons. 
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absolutely convergent) and get 

[^■■- ( n ^^idn.)\dmr'^] n i^itm)dh ■■■dt, (8) 

Let us estimate the expression under the integration. 

Lemma 11. For every I C {1, . . . , n}, I ^ % let ci = 1 if J-i = Ji' for every i, i' G and 
otherwise. Then for every ti, . . . , t„ G [— V^, VR] we have 

E 1d'^"'1i n Kd^)\d^r-^=GHii+on^M) C-^Y n 

di,...,d„ ^ "J m=l ^ ^ /C{l....,n}, Vme/ / 

(9) 

Proof. Recall that in the expression on the left hand side of ([9]) , only square- free di, . . . ,dn are 
involved. We have that g{di, . . . , dn) always takes on two values and 1. More precisely, by the 
Chinese remainder theorem, g{di, . . . ,dn) = Hp '^P,{«:PMi}i where the cpj are "local factors" 
defined by cpj = tt{deg(/) < deg(P) : P|/ + Jj for every i £ 1} for every I C {1, . . . , n}, / 7^ 0. 
We have the following explicit formula: 

1, if Ji = Ji' (mod P) for every i,i' £ I ; 
0, otherwise. 



(-l)l'l+ic. 



cpj 



Let S be the set of all irreducible divisors of A. Then for P outside of S, we have cpj = cj. 
The left hand side of ^ can factor as 



n 1- E (-1)'^'"^ 



cp,i 



P \ /C{l,...,n},7^0 R 

which is an Euler product treated in Lemma [TOl We know from Lemma [TOl that it is equal to 
GH{l + on^oo{l)) n E^r'l^g^ 

IC{l,...,k},I^tD Vme/ / 

where G is the arithmetic factor 

p \ 1^9 ' ' / ^ ' 

Let us verify that this is indeed the same expression for G claimed at the beginning. 
Claim 5. E/c{i,...,n},/^0(-l)'^'c/ = -r 

Indeed, if oi, . . . , a,, are the multiplicities of Ji, . . . , J„, then 

7C{l,...,n},/^0 s=lj=l ^ 

Claim 6. E/c{i,...,n},/^0(-l)'"'="'^ = -«P 
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This follows from exactly the same observation as the previous claim. Thus G = Y\p ^1 — ^1 
and the lemma follows. □ 



By integrating over alHi, . . . , t„ G [— \/^, \fR], we see that the expression in ^ is equal to 



logg 



R 



] GH{l + o{l)) ■■■ H lli^itm)dh--- 

\r POO POO / \ (-1)'^'+^C7 n 

GF(l + o(l)) / ••• / J] + l[i;{tm)dh---dt 

J J-co J-oo r^ri „i r^(7i \»v,cr / ^-1 



/C{l,...,n},7^0 \me/ / m=l 

since "0 decreases rapidly. Thus we have proved the estimate ([3]), with 

/OO /"OO / \ \ . ' /v 

■■■ / n 1^(1 +^*™) n 
-OO J-oo /c{l,...,fc},/^0 Vme/ / m=l 

This expression can be simplified a little bit. Let ai, . . . , a,- be the multiplicities of Ji, . . . , J^, 
then we have C = Ca^ , where 

fOO POO I \ \ ' a 



/OO poo / \ V ' Lfc 

■■■ / n (Y^i^ + itm)] II Htm)dti ■ ■ ■ dta (10) 

In our applications we will be able to compute these constants explicitly in terms of x- D 



10. The CORRELATION CONDITION 



In this section we prove the correlation condition. As mentioned before, we will need a variant 
of Proposition (9) 

Proposition 10. Suppose Ji, J2, . . . ,Jn G Fg[t], not necessarily distinct. Let A = A( Ji, . . . , J„) 

be defined as before in Proposition [3 Then we have the asymptotic formula 

^ AiiiWif + Ji) + 1) • • • AR{W{f + Jn) + 1) = CGH{1 + o„(l))g^ {^^\ (11) 

where G = UdegiP)<., (l " JP])'^ UdegiP)>.. - m) - = nP|A (l + On (^)). 

and C is the same constant as in Proposition 



Proof. The proof follows along the lines of that of Proposition [9l The only difference is that we 
have a different formula for local factors cpj = ji{deg(/) < deg(P) : P\W{f + Ji) + IVi G /)}: 
For any / C {1, . . . , n},I ^ 0, we have 

_ J 1, if deg(P) > w and Ji = Ji' (mod P) for every i,i' G I ; 
^^'^ 1 0, otherwise. 

By incorporating this change into the proof, we will find the desired expression for G. □ 
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Proof of the correlation condition. We are interested in expressions of the form 



B{„{f + hi)---u{f + hi)\fGGN) 

where hi, . . . ,hi E Gat and the number of forms / is bounded by Iq which depends only on k. 
Recah that our goal is to find a function r on Gtv such that 

E(i.(/ + /ii)---K/ + /i/)|/eG^) < <h,-hj) (12) 

i<i<i<^ 

Moreover, for every 1 < p < cxd, 

E(T(/r) = 0,(1) (13) 



In the event where two of the hi are equal, we bound E [u[f + hi) ■ ■ ■ v{f + /i;)]/ G Gat) by 

loo 



= qO{N/iogN)^ thanks to Lemma El By choosing t(0) = qO(N/iogN) ^^^^ clearly the 



inequality 1121 is satisfied. Moreover, since = Oe{q^^) for every e > 0, the addition 

of qi'^(^/'°s^) to r(0) does not affect the boundedness of E(r^) for every p > 1. 

Therefore, we have to find a function t £ for every p > 1 so that the inequality (|12|) is 
satisfied when all the hi are distinct. From the definition of z^, we have 

E(z.(/ + /ii)---K/ + /i/)|/GG^) 



= i?' [-^) E {AR{W{f + ^i) + 1)2 . . . An{W{f + hi) + 1)2|/ G G^) 
Thanks to Proposition [T0| we know that 

E {AniWif + hi) + 1)2 . . . AjiiWif + hi) + 1) V E Gn) = CGH{1 + o(l)) (i^) ' 



where 



n O-^r n (-^)(-^ 

deg{P)<w) ^ ' ' ^ deg(P)>to ^ I I / \ II 



n 1 



/ V IPI / V \P\ 

deg(P)>ui 



Since f^p ( 1 ~ Tpr ) ( 1 ~ m ) = Hp ( -'^ + '^M ) ) converges absolutely, and since w 



\P\J V \P\J - ' Vl^'l 

oo, we have that G = ( "^[^^ ) (l + o(l)). 
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Let us compute C explicitly. In this case, each hi has multiplicity 2, hence equation ([10]) gives 
us: 



C 



CO roo 



[l+iti)il+it2) 



^{ti)^{t2)dtidU 



-OO J —CO 

CO rco 



2 + iti + it2 

{I + iti){l + it2)lp{h)^P{t2)log 



■CO J —CO 
CO 

logg 



^^^^^ 



oo \ 2 



= (logg)-' 
Therefore, 

E + /ii) • • • u{f + /lOI/ G G;v) = (1 + o{l))H 
Recall that i7 = Hpia (l + ('Fl))' ^^^^^ ^ ^ ni<i<j<m('^« ~ ^i)- Let us bound (1 + 
o{l))H by exp X]p|^ , where M is a constant depending only on Iq, hence on k. 



For / / 0, put r(/) = exp X^p|j jp^^ for sufficiently large depending on M, then clearly 

the inequality ()12p is satisfied. The only thing left to verify is the inequality (jl3p . By Lemma 
[9] we have 



E(r(/f|G;v) 



E exp 



For every P, the number of / E Gat that is divisible by P is at most Thus 

log^^'P|P| 



as required. 



□ 



11. The LINEAR FORMS CONDITION 



In this section we prove the linear forms condition. For this condition, the following variant 
of Proposition [9] is needed: 
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Proposition 11. Let ^"1, • • • , ore m non-zero linear forms in n variables (Gjv)" Fq[t], 
not necessarily distinct, of the form 

n 

U^)=Y,Lijfj+h 
for every f = (/i, . . . , /n) G (Gat)", where 

• m < mQ,n < no, where mQ,nQ are constants depending on k. 

• The coefficients Lij,bi £ Fq[t] and deg{Lij) < w/2 for every i,j. 

• For any two i,i' = l,...,m, either the two vectors (-^^ij)j=i,...,n CL^d (^i'j)j=i,...,n ore 
not proportional (over Fq{t) ), or they are identical and ipi = ipi' . 

Let r be the number of distinct forms in ipi, . . . ,ipm- Then (assuming that R = aN and a 
is sufficiently small depending only on mQ,nQ) we have the following asymptotic formula as 
N oo; 

An{WMf) + l)■■■AR{W^Pm{f) + l) = C{l + oil))(^^ Vl^y^A^n (14) 
where C is a computable constant (depending only on x o,nd the multiplicities of the ipi). 



Proof. Tlie proof is similar to that of Proposition [9j The left hand side of (jl4p is equal to 

£.,(nM.,.(^))f^,". 

where g{di, . . . , dm) is the number of solutions f G {G\[di,...,d,n]\)^ system of congruences 

di\W^i{{) + 1 for every i = 1, . . . ,m. 

Again, by the Chinese Remainder Theorem, g{di, . . . , dm) factors as Yip cp,{j:P|dj}, where the 
local factors cpj are defined by cpj = tj{f G (G(jeg(p))" : P\Wipi{i) + 1 for all i £ L} for every 
/C{l,...,m}il7^0. 

Clearly if deg(P) < w then cpj = 0. Let us compute cpj when deg(P) > w. In particular 
gcd(P, W) = 1. The system of congruences P\WTpi{{) + 1 for all G / amounts to a system of 
|/| equations ipi{i) = —W~'^ for every i £ I, where the are now regarded as affine maps 
(Fp)" ^ Fp, where Fp = Fg[t]/{P). 

Note that when regarded as forms on Fp, the property that for any two forms ipijipi', either 
their homogeneous parts are not proportional or they are identical, is still preserved. Indeed, 
this is obviously true if n = 1. Suppose n > 2 and we have that for some i,i', — 



in Fp for any = 1, . . . ,n. Then P divides LijLiiji — Li/jLij>. Since deg(Ljj) < w/2 for 
every i,j, this means that LijLi/j/ — Li/jL^ji = 0. Therefore, the two vectors (i>ij)j=i,...,n and 
(-Lj/j)j=i^...^„ are indeed proportional over Fq(t), so that the forms ^/'j and ijjii are identical. 

Being the number of solutions to a system of non-trivial linear equations over Fp, cpj is either 
or a power of |P| not exceeding By the assumption made on the ipi, cpj = 

if and only if all the forms in / are identical. Otherwise, cpj = 0(|P|"~^). 
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Incorporating this change, we have that 



By the above computation, this is equal to 



deg(P)>ui 



(1+°™ n I'-E , ':ri. 



deg(P)>'U) 

where the sums are taken over ah non-empty subsets / of {1, ... , m} such that the tpiji E /, are 

all identical. We now understand the use of the VF-trick: it helps absorb the terms O ^j^^, 

which in turn comes from congruence relations between coefficients of the ipi. Invoking Lemma 
[TOl we see that the above product is equal to 

_ (-1)1^1+1 

where G is the arithmetic factor G = ndeg{P)>to + ^~[p\ ' ) (-'- ~ \p\j ' ' prod- 
uct being taken over all non-empty subsets / of {1, . . . ,m} such that the ipiji G /, are all 
identical. It is easy to see that X]/(~l)l^l = so that G = {1 + o(l)) ^ "^^^ 

We then proceed as in the proof of Proposition [9] and see that 



fe(Gjv)" 
where 



A«(TyV'i(/) + l)---ApWi(/) + l) = C(l + o(l))(^) '(^) 



— r 

..Nn 



(-1)1 

m 



^1 + 1 

/oo poo I \ '"■ 

•••/ n iiiJit,)dti---dt„ 
■^-'^ I \jei J j=i 

the product being taken over all non-empty subsets / of {!,... ,m} such that the ipiji G I, 
are all identical. Similarly to the constant C in Proposition [9l G factors as ns=i ' where 
oi, . . . , Or are multiplicities of the ipi, and 



(-1) 



/oo poo I \ 

-oo ''-'^ Ic{l,...,a},I^fD J ra=\ 

□ 



Proof of the linear forms condition. We are interested in expressions of the form 

E(z.(V'i(f))---KV'm(f))|f G(F,ivr) 
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where the = Yl^=i^ijfj + Hnear form in n variables, where m < mQ,n < no, 

no two homogenous parts are proportional, and the coefficients Lij are in the set : 
deg(P), deg((5) < k}. Recall that we want to bound these expressions by 1 + o(l). 

By the definition of u, this expression is equal to 

(7^) " {^RiWM^) + !)'••• MWiJmif) + l)'|f G (F,iv)") (15) 

Our first reduction is to replace the assumption that all the coefficients Lij are in {^|/, G G^} 
by Lij G Gm for some sufficiently large M depending on k. Indeed, via a change of variables 
f ^ ^riheGfc h monic ^) ^' V'i bccome linear forms with coefficients in Fq[t] and of degrees 
stiU bounded deg {Uh^a^h monic h) + k = Y!Z\ dq^ + k = M. 



We are tempted to apply Proposition 1111 right away. However, a priori the ■0j are linear forms 
from (F^jv)" to F^jv, which are different from the linear forms from (Fg[t])"' to Fq[t] given by 
the same formula ^j(f) = Yl^=i Lijfj + bi for every f = (/i, . . . , /„) G (Fg[t])". More precisely, 
^pi{^) is the residue of \I'j(f) upon division by /at, the irreducible polynomial underlying F^jv. 

To remedy this, let us divide (Fq[i])"' into q"^^^ "boxes" such that for f , f in the same box B, 
we have maxj \fi — /■{ < q^~^ (in other words, each box is a product of cylinders of radius 
q^' in Fq[t]). Then for f , f in the same box, we have ^'i(f) - ^i(f') < g^-^^+A^ = qN _ xhis 
means that the residues of ^'j(f) and ^i(f) upon division by Jn are the same. In other words, 
for any box B, we have a formula for ipi{f): 

n 

i=i 

for every f G B, and bi^B depends alone on the box B. 



We now rewrite the expression ()15p as 



Sr) " ^ ^ {AdW^iMn + !)'••• Ai?(W^Vr«,i.(f) + l)'|f G i?) 

I / q Q 

Note that for A'^ sufficiently large, we have deg(Ljj) < M < u;/2. For each box B, Proposition 
[TT] tells us that 

E (Afi(iyVi,B(f) + !)'••• A«(T^Vn^,B(f) + l)'|f eB)= C{1 + 0(1)) (^^) " (^)™ 

Similarly to the calculations in the proof of the correlation condition, we see that C = (log q)"^. 
Summing this up over all the q^^ boxes, the linear forms condition is therefore verified. □ 
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